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DETAILED ACTION 
Response to Arguments 

Applicant's arguments with respect to claim 1 have been considered but are moot 
in view of the new ground(s) of rejection. 

Regarding claims 2-3 Vidovic does suggest a video linking system by explaining 
a videotape recording apparatus. It is well known in the art that a videotape recording 
apparatus can be considered a video linking system because it links video. For 
example a videotape recording apparatus may stop video in one frame, then fast- 
forward and restart the video in another frame and therefore linking the video frames. 

Regarding claims 4-5, Toklu does not explicitly disclose a video linking system, 
which generates linked video files. It should be noted that linking video segments could 
be interpreted in the art as joining video clips in a sequential or serial manner. Thus, 
Toklu does teach video summarization methods including segmenting a video into 
segments such as video "shots" and selection one or more key-frames from the shots. 
(Col 1, lines 34-37) It should be noted Rangan also teaches segmenting video to be 
tracked, frame-by-frame. (Col 6, lines 39-44) and furthermore it is well known in the art 
that a key-frame (segment break) may be used to indicate the beginning or end of a 
change made to the video signal. 

In response to applicant's argument that the references fail to show certain 
features of applicant's invention, it is noted that the features upon which applicant relies 
(i.e., copyright owners of video content view the process of combining the data stream 
with the video content data stream as an alteration of video content and perhaps the 
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video quality and as such copyright owners are more likely to permit playback of original 
video content independent of data stream) are not recited in the rejected claim(s). 
Although the claims are interpreted in light of the specification, limitations from the 
specification are not read into the claims. See In re Van Geuns, 988 F.2d 1181, 26 
USPQ2d 1057 (Fed. Cir. 1993). 

Definition 

It should be noted the specification never explicitly discloses that linked video 
files are played back independently while said video content is being played back, 
(claim 1) Nonetheless, upon further review of the disclosure, applicant recites in 

N 

regards to Fig. 1, "...the video hosting platform 38 is shown separate from playback 
platform 43." (p. 7 paragraph 26, p. 19 paragraph 48) Thus, linked video files played 
back independently while said video content is being played back can be interpreted, 
with regards to the specification, as linked video files played back on a platform 
separate from the platform utilized to play back video content. There, this is the 
interpretation taken by the examiner in the following rejections. 

Claim Rejections - 35 USC § 103 
The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

Claim 1 is rejected under 35 U.S.C. 103(a) as being unpatentable over Rangan 

(6198833) in view of Rhoads. (6411725) 
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In claim 1, Rangan teaches an image processing system for processing video 
content in a sequence of video frames and linking one or more pixel objects embedded 
in said video content to selected data objects in a sequence of video frames by 
explaining a system is provided for tracking a moving entity in a video presentation, the 
system comprising a computer station presenting the video presentation on a display 
as a series of bitmapped frames; and a tracking module receiving the video data 
stream. (Col 3, lines 26-29); said image processing system comprising a video 
capture system for capturing a frame of said sequence of video frames to be viewed 
defining a captured video frame by showing a recording function for accepting the 
positions wherein the pixel signature (defined in the art as a local neighborhood around 
given pixel) most closely matches the image signature as the true positions of the 
image entity in the next frames. (Col 3, lines 43-46) and in FIG. 1 input data stream 15 
to tracking module 13 is a stream of successive bitmapped frames in a normalized 
resolution, required by the tracking module. (Col 5, lines 35-37) The authoring station 
can be based on virtually any sort of computer platform and operating system, and in a 
preferred embodiment, a PC station running MS Windows is used, in which case the 
input stream 16, regardless of protocol, is converted to a digital video format that can 
be interpreted and played back as a sequence of bitmapped frames. (Col 5, lines 37- 
43) Furthermore Rangan teaches a user interface for enabling a user to select one or 
more pixel objects in said captured frame defining selected pixel objects. (Col 4 lines 
1 1-35). Additionally Rangan teaches a pixel object tracking system, which includes a 
processor, which automatically tracks, said selected pixel objects in other frames. (Col 
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3, lines 26-50). It should be noted that it is well known in the art that a computer 
system would inherently contain a processor. Rangan also teaches said video linking 
system generating one or more linked video files, separate from said video content 
(Col 6 lines 48-51 , Col 10 lines 53-66) by explaining when tracking element 29 (Fig. 2) 
is positioned and activated over an image entity to be tracked, a signature table is 
created and stored (Col 8, lines 40-42) and upon tracking element 29 being activated 
the tracking module creates a table or list comprising pixel values associated with a 
target number and spatial arrangement of pixels associated with tracking element 29. 
(Col 7, lines 40-43). Although Rangan does not explicitly state the generation of video 
files, it is inherent to the invention that a table or list, which is created by the tracking 
module and subsequently stored, must implicitly require files for storage function. 
Lastly, Rangan teaches through additional editing processes, a moving region 
associated with the image entity in a display may be made to be interactive and 
identifiable to an end user. (Col 6, lines 55-57). Rangan further teaches user 
interaction with such an image entity during viewing of a video can be programmed to 
provide additional network-stored information about that a entity to suitable customer 
premises equipment (CPE) adapted to receive and display that information (Col 6, lines 
57-62) and such further information may be displayed, for example, as an overlay on 
the display of the dynamic video containing the subject image entity. (Col 6, lines 62- 
64) It should be noted Rangan further teaches providing one or more links to 
predetermined data objects for each pixel object. (Col 7 lines 25-52, Fig. 2) 
Nonetheless, Rangan fails to explicitly teach said video linking system generating one 
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or more linked video files separate from said video content, being configured to identify 
the pixel objects by frame number and location within the frame. It would have been 
obvious to one of ordinary skill in the art at the present time the invention was made to 
utilize user editing processes and programmable capabilities of stored information 
about an image entity, as taught by Reagan, to identify the pixel objects by frame 
number and location within a frame because it is well known in the art that stored 
information about an image entity will include information about the image object's 
frame number and location within the frame in order to properly retrieve and display 
that information. Furthermore, these user programmable abilities allow advertisers, 
product promoters, or the like to present information to end users based on user 
interaction with an associate entity in a dynamic video display. (Col 6, lines 64-67) 
Reagan also teaches linked video files configured to be synchronized with said video 
content. (Col 6, lines 48-51 and Col 10, lines 53-56) However, Reagan fails to explicitly 
teach video files played back independently while said video content is being played 
back. This is what Rhoads teaches. (Col 7 lines 9-25). It should be noted that the 
video content as taught by Rhoads is a video stream and the video linking files as 
taught by Rhoads correspond to a short title (or number) and location of marked video 
objects within the video stream contain these objects. It would have been obvious to 
one of ordinary skill in the art at the present time the invention was made to combine 
the teachings of video files played back independently of video content into the system 
of Reagan because an encoding process to facilitate in user selection of video objects 
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on the decoding side can be achieved (Col 7 lines 9-11) and thus, information surviving 
video capture can be realized. (Col 2 lines 19-22) 

Claims 2-3 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Rangan (6198833) in view of Rhoads (6411725) in further view of Vidovic (3878557). 

Consider claim 2, Rangan teaches a predetermined playback rate by showing in 
one preferred embodiment the subject video is displayed typically at 30 frames per 
second with a resolution of 352 by 240 pixels. (Col 5, lines 43-46) However, neither 
Rangan nor Rhoads explicitly teaches said video linking system samples said video 
content at a sample rate of less than said predetermined playback rate. This is what 
Vidovic teaches. Vidovic teaches a videotape recording apparatus, which shows color 
frame pulses separated by 66 milliseconds and have a 15Hz rate (Col 23, lines 56-57 
and Fig 17B) It would have been obvious to one of ordinary skill in the art at the 
present time the invention was made to combine a videotape recording apparatus 
sampled at 15Hz as taught by Vidovic with a video linking system displaying video at 
30 frames per second as taught by Rangan in order to show the two possible phases 
of the color frame reference pulses derived from the input color video signal (Col 23, 
lines 57-60 and Fig 17) and thus, to direct in choosing the correct phase. (Col 7, lines 
54-55) 

Consider claim 3, Vidovic does not explicitly define a sample rate of three frames 
per second. However, it would have been obvious to one of ordinary skill in the art at 
the present time the invention was made to lower a 15 Hz sampling rate for videotape 
recording as taught by Vidovic to 3 Hz sampling rate because it is well known in the art 
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that it is always practical to lower a sampling rate due to bandwidth or file size 
limitations. 

Claims 4-5 are rejected under 35 U.SC 103(a) as being unpatentable over 
Rangan (6198833) in view of Rhoads (641 1725) in further view of Toklu (6549643). 

Regarding claim 4, the teachings of Rangan and Rhoads are given in the 
previous paragraphs of this Office Action. However, neither Rangan nor Rhoads 
explicitly teaches said video linking system is configured to identify segment breaks in 
said video content. This is what Toklu teaches. Toklu teaches video summarization 
methods typically include segmenting a video into an appropriate set of segments such 
as video "shots" and selecting one or more key-frames from the shots. (Col 1 , lines 34- 
37) It should be noted that a key-frame is defined in the art to be a frame used to 
indicate the beginning or end of a change made to the signal and therefore, an implied 
segment break. It would have been obvious to one of ordinary skill in the art at the 
present time the invention was made to combine video summarization methods 
configured to identify segment breaks as taught by Toklu with the image processing 
system as taught by Rangan in order to reduce the number of images to one or more 
key-frames to represent the content of a given shot (Col 1, lines 43-45) and thus, to 
generate a video summary. (Col 1, line 33). 

Regarding claim 5, the teachings of Rangan and Rhoads are given in the 
previous paragraphs of this Office Action. However, neither Rangan nor Rhoads 
explicitly teaches said segment breaks are determined by determining the median 
average pixel values for a series of frames and comparing changes in the pixel values 



Application/Control Number: 10/786,777 Page 9 

Art Unit: 2628 

relative to the median average and indicating a segment break when the change in 
pixel values represents at least a predetermined change relative to the median 
average. This is what Toklu teaches. Toklu teaches determining median average pixel 
values for a series of frames by showing computing an average of an absolute pixel- 
based intensity difference between consecutive frames in each segment, and for each 
segment, computing a cumulative sum of the average of the absolute pixel-based 
intensity differences for the corresponding frames of the segment. (Col 3, lines 61-67) 
Toklu also teaches comparing changes in pixel values relative to median average by 
explaining selecting the first frame in each motion activity segment of a given segment 
frame if the cumulative sum of the average of the absolute pixel-based intensity 
differences for the frames of the given segment does not exceed a first predefined 
threshold. (Col 4, lines 1-5) Lastly, Toklu teaches indicating a segment break when the 
change in pixel values represents at least a predetermined change relative to the 
median average by showing selecting a predefined number of key-frames in the given 
segment uniformly, if the cumulative sum of the average of the absolute pixel-based 
intensity differences for the frames of the given segment exceeds the first predefined 
threshold. (Col 4, lines 5-9) It should be noted that a key-frame is defined in the art to 
be a frame used to indicate the beginning or end of a change made to the signal and 
therefore an implied segment break. It would have been obvious to one of ordinary 
skill in the art at the present time the invention was made to combine determining the 
average pixel values for a series of frames, comparing changes in pixel values relative 
to the average and indicating a segment break when the change in pixel values 
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represents at least a predetermined change relative to the median average as taught 
by Toklu with the image processing system as taught by Rangan in order to measure a 
temporal activity curve for dissimilarity based on frame differences. (Col 3, lines 60-62) 
and thus, make possible in the system and method for selecting key-frames from video 
data. (Col 3, lines 51-59) 



Any inquiry concerning this communication or earlier communications from 
examiner should be directed to Kevin K Xu whose telephone number is 571-272-7747. 
The examiner can normally be reached on Monday-Friday from 9 AM - 5:30 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Mark Zimmerman can be reached on (571)-272-7653. 

Information regarding the status of an application may be obtained from Patent 
Application Information Retrieval (PAIR) system. Status information for published 
applications may be obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. For 
more information about the PAIR system, see http://pair-direct.uspto.gov . Should you 
have questions on access to the Private PAIR system, contact the Electronic Business 
Center (EB) at 866-217-9197 (toll-free). 
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